A Unit Selection-based Speech Synthesis Approach for Mandarin Chinese
نویسندگان
چکیده
The paper presents a unit selection-based speech synthesis approach for mandarin Chinese. Unit selection-based approach generates speech by selecting proper units from a speech corpus and connecting them together. In this approach, a set of features are defined to describe the speech units in the corpus and the expected units in the synthesized utterance. Based on the features, cost function is defined to select a sequence of units that are able to generate high quality speech. The cost function describes the two aspects of the generated speech, ie. (1) the appropriateness level of each unit itself. (2) the smoothness level between two units to be concatenated. Viterbi search algorithm is used in this approach to find the best unit sequence that minimizes the two costs. The authors use a new prosody description to ensure the prosody quality of the generated speech. Experiment shows that this approach can generate very high quality speech.
منابع مشابه
A Unit Selection-based Speech Synthesis Approach for Chinese Mandarin Text-to-Speech
The paper presents a unit selection-based speech synthesis approach for Chinese Mandarin. Unit selection-based approach generates speech by directly connecting pre-recorded speech units. In this approach, a corpus is used as a source unit inventory. A feature vector is defined to describe each unit. To generate speech, the feature vector of the target unit is first calculated. During synthesis,...
متن کاملStudy on Unit-Selection and Statistical Parametric Speech Synthesis Techniques
One of the interesting topics on multimedia domain is concerned with empowering computer in order to speech production. Speech synthesis is granting human abilities to the computer for speech production. Data-based approach and process-based approach are the two main approaches on speech synthesis. Each approach has its varied challenges. Unit-selection speech synthesis and statistical parametr...
متن کاملIssues in Text-to-Speech Conversion for Mandarin
Research on text-to-speech (TTS) conversion for Mandarin Chinese is a much younger enterprise than comparable research for English or other European languages. Nonetheless, impressive progress has been made over the last couple of decades, and Mandarin Chinese systems now exist which approach, or in some ways even surpass in quality available systems for English. This article has two goals. The...
متن کاملA novel hybrid approach for Mandarin speech synthesis
The paper investigates a new method to solve concatenation problems of Mandarin speech synthesis which is based on the hybrid approach of HMM-based speech synthesis and unit selection. Unlike other works which use only boundary F0 errors as concatenation cost, a CART based F0 dependency model which considers much context information is trained to measure smoothness of F0. Instead of phoneme-siz...
متن کاملThe WISTON Text to Speech System for Blizzard Challenge 2010
The paper introduces the speech synthesis system developed by Institute of Automation, Chinese Academy of Sciences(CASIA) for Blizzard Challenge 2010. The large corpus based speech synthesis system, WISTON, was built to synthesize Mandarin speech. In this year, a new prosodic structure prediction model was used, which is more precise and compact than before. Furthermore, two kinds of syllable s...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Journal of Chinese Language and Computing
دوره 16 شماره
صفحات -
تاریخ انتشار 2006